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Abstract 

We present an analysis of the spectral density of the adjacency matrix of large random trees. We show 
that there is an infinity of delta peaks at all real numbers which are eigenvalues of finite trees. By exact 
enumerations and Monte-Carlo simulations, we have numerical estimations of the heights of peaks. In the 
large tree limit, the sum of their heights is 0.19173 ± 0.00005. Moreover all associated eigenvectors are 
strictly localized on a finite number of nodes. The rest of the spectral density is a function which vanishes 
at all positions of peaks, which are a dense subset of real numbers: so this function is almost everywhere 
discontinuous. 

Keywords: random tree, spectral density, density of states, adjacency matrix, localization, delta peak. 



1 Introduction 

Many models of graphs have been investigated in the last 
decades. This interest is mainly motivated by their numer- 
ous areas of applications in physics (propagation and per- 
colation in disordered media, quantum gravity, etc.) and 
in other branches of science: combinatorial optimization, 
queuing theory, computers networks, interactions between 
biological molecules, quantum chemistry and many oth- 
ers. But their study by methods and concepts of statistical 
mechanics is more recent. The point of view of statistical 
physicists is generally different from the one of mathemati- 
cians, in particular by studying averaged quantities in the 
"thermodynamic" limit of infinite graphs rather than spe- 
cific quantities to a given finite graph. Moreover some re- 
sults have been obtained by Monte-Carlo methods, which 
are often regarded as "heretical" by mathematicians who 
prefer exact and proved results. 

We can describe a graph as a set of n nodes (or vertices) 
with interactions between pairs of nodes, represented by a 
nxn matrix. Then we are naturally interested by the spec- 
trum of this adjacency matrix. See for example the book 
[Cvetkovic and al., 1995] for the theory and applications 
of these spectra. Many works have been done about the 
singularities of the spectral density and their connections 
with the localization of eigenvectors, also called quantum 
percolation [Evangelou, 1983; Mirlin, 2000; Bauer and Go- 
linelli, 2001; id. 2001a]. The spectrum of the Laplacian 



matrix has also been studied [Biroli and Monasson, 1999]. 

In particular, it is known [Kirkpatrick and Eggarter, 
1972] that strictly localized eigenvectors (i.e. with a fi- 
nite number of non-zero coordinates) can appear on finite 
parts of large connected graph. Moreover the associated 
eigenvalues contribute to delta peaks in the spectral den- 
sity. But the strict localization can occur only on parts of 
graph with special patterns. The motivation of our work 
is to study quantitatively this phenomenon for large (i.e 
in the infinite size limit) connected graphs. 

As we have no reasons to choose a particular graph, we 
turn to models of random graphs. Of course, quantita- 
tive results depend on the particular choice of the model. 
Our first idea is to study the famous Erdos-Renyi model 
of random graph [Erdos and Renyi, 1960] but it presents a 
default: a graph on n nodes consists (in the high connec- 
tivity phase) of a giant connected component with 0(n) 
nodes on which strict localization can occur, plus a number 
0(n) of finite connected components on which eigenvec- 
tors are always strictly localized. As we are not interested 
by these ones, we must consider either a modified Erdos- 
Renyi model by keeping only the giant component, or an 
other model. 

We prefer the random labeled tree model described in 
Sect. 2, motivated by the following considerations. Firstly 
large Erdos-Renyi graphs on n nodes have circuits (or 
loops) with length O(lnn), so they have locally a tree 
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shape. As the strict localization occurs statistically on 
small number of nodes, it is little sensitive to circuits. 
Furthermore by definition a tree is always connected and 
we dismiss the problems due to small components. And 
finally a random tree is simpler to generate than other 
models of connected graphs. In this work, the interaction 
matrix is the adjacency matrix of the tree: it is the Hamil- 
tonian of a particle which hops at each time step from one 
node to a connected node. Remark that it is different to 
the Laplacian matrix which describes the diffusion of a 
particle with continuous time. 

As analytical methods give only partial results, we use 
two numerical methods: exact enumeration of small trees, 
described in Sect. 3.1 and Monte-Carlo simulations for 
large trees, described in Sect. 3.2. Qualitative descrip- 
tion of the spectral density is done in Sect. 4; existence 
of an infinity of delta peaks is proved in Sect. 5.1 and 
their statistics is given in Sect. 5.2. Appendix A contains 
details about the use of symmetry for the numerical com- 
putation of the spectra. In Appendix B, we show that the 
set of eigenvalues of finite trees are a dense subset of the 
real numbers. Finally Appendix C explains the difficul- 
ties encountered to extend analytical results obtained in a 
previous work [Bauer and Golinelli, 2000]. 

2 Definitions and generalities 

In this article, unless otherwise stated, the term tree refers 
to a labeled tree. A (labeled) tree on n nodes is a connected 
graph with nodes (or vertex) labeled {1,2, ... ,n} linked 
by n — 1 simple (i.e. undirected, loopless and not multiple) 
edges. Consequently a tree is without circuit (or polygon). 
Two nodes are called adjacent or neighboring if they are 
connected by an edge. A leaf is a node with only one 
neighbor. Note that a tree is a bipartite graph: the set of 
nodes can be partitioned into two subsets so that adjacent 
nodes arc in different subsets. 

A theorem due to Cayley says that the number of dif- 
ferent labeled trees on n nodes is 

T(n) = n n - 2 . (1) 

A simple proof uses the Priifer coding explained in 
Sect. 3.2. See the book [Van Lint and Wilson, 1992] for a 
general presentation. 

The adjacency matrix of any tree on n nodes is the n x n 
square matrix A such that Aij = 1 if nodes i and j are 
adjacent and otherwise. Then A is symmetric, with 
zeroes on the diagonal. The walks on a tree are counted 
by its adjacency matrix: the number of walks of length 
k starting at node i and finishing at node j is (A k )ij. 
Similarly, TrA k is the number of closed walks of length k. 

The spectrum (set of eigenvalues) of the adjacency ma- 
trix A of a tree T is more simply called the spectrum ofT. 
By definition, A is an eigenvalue associated with the eigen- 
vector V = (Vi, V 2 , . . . , V n ) if AV = XV. So the eigenvalue 



equation on the node i is 

W = £ Vj, (2) 

j adj i 

where the sum runs over the nodes j adjacent to i. As 
A is symmetric, its spectrum consists of real eigenvalues 
with a complete orthogonal basis of real eigenvectors. Fur- 
thermore, the spectrum is symmetric with respect to zero 
because a tree is bipartite: each eigenmode (A, V) has a 
partner (—A, V), where the vectors V and V are equal on 
the nodes of one subset and opposite on the nodes of the 
other subset. See Appendix A for an application of this 
property to the numerical computation of the spectrum. 

The symmetry factor of any tree is the number of per- 
mutations of the nodes that leave invariant this tree. Two 
trees are isomorphic if they differ only by a permutation 
of their nodes, or equivalently if their adjacency matrices 
differ only by a permutation of rows and columns. The 
number of trees isomorphic to any tree on n nodes with 
symmetry factor S is nl/S. The term labeled emphasizes 
that we are not identifying isomorphic trees. Therefore an 
unlabeled tree is an isomorphism class of labeled trees. 

Clearly, isomorphic trees are cospectral (i.e. they have 
the same spectrum) , but the reciprocal is wrong: the pro- 
portion of trees on n nodes which have a cospectral but 
non isomorphic partner goes to 1 when n becomes large 
[Cvetkovic and al., 1995, ch. 6]. 

The trees on n < 5 nodes (and their spectra) are listed 
in Table 1. A larger table for n < 10 is in [Cvetkovic and 
al., 1995]. 

In order to speak about average properties of trees (for 
example their spectral density), we turn the set of T(n) 
trees on n nodes into probability space with cquiprobable 
elements, called random (labeled) trees. In other words, 
a random tree on n nodes is randomly chosen among the 
T(n) trees. 

Note that the models of random labeled trees and of ran- 
dom unlabeled trees are not equivalent: trees with a high 
symmetry factor are less probable with the labeled model. 
For example, we have numerically computed that the av- 
erage fraction of the spectrum occupied by the eigenvalue 
in tree on n = 23 nodes is 0.1437. . . for a labeled tree 
and 0.2329. . . for an unlabeled tree. By considering the 
convergence, the size n = 23 is sufficiently large to so 
that a gap exists in the large n limit. It can be explained 
because most of eigenvectors with A = have only two 
non-zero coordinates which are opposite and localized on 
two leaves adjacent to a same node. As this pattern gives 
a symmetry factor 2, eigenvalues zero are less frequent in 
the labeled model. 

3 Numerical methods 

We use two numerical methods: full enumeration for small 
trees and Monte-Carlo simulations for large trees. 
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T 5 ,2 

T 53 • • • • • 

Table 1: Spectra of trees onn < 5 nodes. The column 



tree 



T 3 



T4,2 



• — • — • — • 



n\/S 



4 
12 



5 

60 
60 



spectrum 



{0} 



{-1, 1} 



{-V2, 0, V2} 



{-V3, 0, 0, V3} 
{±(V5-l)/2, ±(V5 + l)/2} 



{-2, 0, 0, 0, 2} 



{0, ±y/2- y/2, ±V2 + V2} 
{-V3, -1, 0, 1, V3} 
n\/S gives the number of labeled trees isomorphic to the drawing. 



3.1 Enumeration 

If we want to enumerate all the T(n) = n n ~ 2 trees on 
n nodes with a computer (for example with the Priifer 
coding), the problem becomes very hard as soon as n w 
10. It is faster to enumerate all the unlabeled (i.e non- 
isomorphic) trees. For each unlabeled tree T on n nodes, 
the spectrum (independent of labels on the nodes) is nu- 
merically computed. As T represents a class of isomorphic 
labeled trees, the spectrum is counted with a multiplicity 
equal to the number of ways of labeling T. i.e. n! divided 
by its symmetry factor. 

Let us call t{n) the number of unlabeled trees on n 
nodes. The sequence t(l), t(2), . . . , starts with 1, 1, 1, 
2, 3, 6, 11, 23, 47, 106, . . . Since works of Jordan, Cayley, 
Polya and Otter, we known [Knuth, 1997, p. 388] that 
t(n) grows exponentially like 



t(n) ~ 



TV 



5/2' 



(3) 



with a = 2.955765285652 . . . and = 0.5349496061 . . . 

To enumerate the t(n) unlabeled trees, we used a simpli- 
fied version of the WROM algorithm [Wright et al., 1986] 
with computation time 0(n 2 t(n)), and not 0(t(n)) as the 
original version. As the computation time for the spec- 
trum of one n x n matrix is C(n 3 ), and 0(n 2 ) or less for 
the symmetry factor in the worst case, we see that the 
slow part is not the enumeration work, but the computa- 
tion of the t(n) spectra. For the same reason, we did not 
find useful to work with algorithms like the recursive one 
of [Li and Ruskey, 1999] 

Thus, in this work, we have enumerated trees up to n = 
23, where i(23) = 14 828 074. As the computational time 
grows exponentially, it is difficult to enumerate spectra of 
trees with much more nodes. 



3.2 Priifer Monte-Carlo method 

To study large trees, the full enumeration is impossible. 
The Monte-Carlo method consists by randomly generat- 
ing a set of S independent trees, then measuring some 
properties and averaging them. Of course, the results have 
random noise, which can be estimated with usual formulae 
of statistics. In good cases, the noise decreases as 1/y/S. 

Main points are to generate trees with the right proba- 
bility law and with an efficient algorithm. For example, let 
us consider the naive method which generates a random 
graph by choosing n — 1 edges between n nodes, then re- 
jects this graph if not a tree. As trees are very rare among 
the set of graphs, this method is not efficient. 

If we modify the procedure by rejecting edges which 
close a cycle, we generate at each time a tree but now 
with a non uniform probability law. For example, for 
trees on 4 nodes, Prob(T4 i) = 1/15 and Prob(T4.2) = 
11/180 instead of 1/16. For n = 5, Prob(T 5 .i) = 1/105, 
Prob(T 5 , 2 ) = 127/15120 « 1/119 and Prob(T 5 , 3 ) = 
113/15120 w 1/134 instead of 1/125. We see that this 
procedure has a bias which favors trees with highly con- 
nected nodes. More seriously the bias grows exponentially 
with the size: the "star" tree (made of one central node 
connected to n — 1 peripheral nodes) has a probability 
l/(2n - 3)!! instead of l/n"- 2 . 

So we use the Priifer Monte-Carlo method, which is effi- 
cient and not biased. We first describe the Priifer coding, 
but without giving a proof. Let T be a labeled tree on 
n nodes. The Priifer coding consists in removing succes- 
sively one leaf at each step. We start with Ti = T. For 
1 < i < n — 1, the step i is the following: let bi be the leaf 
of Ti with the smallest label, let ai be the neighbor of bi 
and let T i+1 be the tree obtained by deleting from Tj the 
leaf bi and the edge {bi, c^}. The Priifer code of T is the 
sequence (a\, a,2, ■ • ■ , a n -2)- As a„_i = n necessarily, it is 
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not included in the code. 

Remark that the sequence (61, 62, . . . , b n -\) is always a 
permutation of (1, 2, ... , n — 1). In contrast, repetitions 
can occur among the code. More precisely, the number of 
neighbors of the node j is the number of j in the code, plus 
one. Consequently the leaves are the nodes which never 
appear in the code. 

To reverse this procedure, start with an arbitrary code 
(01, (I2, . . . , flSn-2) with 1 < dj < n. By convenience, we 
complete with a„_i = n. We built a tree T with the 
following iterative procedure: for 1 < i < n — 1, at step i, 
let bi be the least number in [1, n] not in (bi, 62, ... , bj-i) 
and not in (eij, . . . , a n -i)- The tree T is made of the n—1 
edges {fe^aj. 

Then the Priifer coding is an one-to-one correspondence 
between the labeled trees on n nodes and the n n ~ 2 codes 
(cii, 02, ■ • ■ , 0,71-2) with 1 < Oj < n. This proves in partic- 
ular that T(n) = n™" 2 . 

This coding gives an easy Monte-Carlo method: a ran- 
dom labeled tree on n nodes is built by choosing n — 2 
independent random integer numbers in [l,n] and by ap- 
plying the reverse procedure. Then the spectrum is com- 
puted. As usual with Monte-Carlo methods, the averages 
are done with as many independent trees as possible, in 
order to reduce the statistical noise. Of course the eigen- 
values are not independent: for example, each A 7^ is 
generated simultaneously with —A. So the estimators of 
variance must be calculated by considering that the inde- 
pendent events are the generated trees, then their whole 
spectra, but not the eigenvalues separately. 

As the computation time to generate a random code 
and then the corresponding tree is 0(n), it is possible to 
obtain trees on several millions nodes. But the slow part 
is always the computation of the spectrum which needs 
C(n 3 ) per tree (see details in Appendix A). This limits to 
several thousands nodes. 

We simulate different sizes to evaluate finite size effects. 
For each size n = 30, 50, 100, 200, 500, 1000, 2000, 5000, 
10000, we obtained m — 30.000.000 eigenvalues, i.e we 
simulated m/n random labeled trees of size n. The main 
cpu time consumption was for the larger size. We judged 
that it is not interesting to simulate larger trees (with 
the same global cpu time): m would be smaller and the 
fluctuations bigger. 

In principal we can simulate random unlabeled trees with 
the Priifer Monte-Carlo method. This procedure generates 
trees with uniform probability if they are labeled, but non 
uniform if they are considered as unlabeled. So we can 
correct this bias by giving a weight on each generated tree 
equal to its symmetry factor, inversely proportional to its 
probability. We have not simulated unlabeled trees, but 
it is probable that new problems appear when n is large: 
as very few trees with very large weights dominate the 
averages, the fluctuations are large and non-gaussian. This 
is similar to the simulation of the low temperature phase 



of an Ising model by generating random independent up 
or down spins, i.e. at infinite temperature. 

4 Spectral density of large random 
trees 

By definition, the spectral density (or density of states) of 
any tree T on n nodes with spectrum {Ai, A 2 , . . . A„} is 
the distribution 



Pt(x) 



1 " 



Ai) 



(4) 



i=l 



and the average spectral density of random trees on n 
nodes is 

1 -Hp^ x ) (5) 
where the sum runs over the n n ~ 2 trees on n nodes. The 



Pn{x) = 



n" 



bounds of the support of p n (x) are maxjA^} = \/n — 1 and 
min{Ai} = — yn-l, which are eigenvalues of the "star" 
tree, made of one central node and n—1 peripheral nodes. 
We are mainly interested by the asymptotic distribution 
p(x) when n is large, in the spirit of the statistical physics. 

4.1 Moments of the spectral density 

We consider pk,n — j dx x k p n (x), the fc'th moment of the 
average spectral density p n . It is related to the number of 
closed walks of length k on the trees on n nodes by 



n 



Pk.n 



= E TrT « 



(6) 



where the symbol T n represents at the same time a tree 
on n nodes and its adjacency matrix. 

As trees are bipartite, their spectra are symmetric with 
respect to zero and the odd moments P2k+i,n = 0. Fur- 
thermore, the first even moments p2k,n can be calculated 
by enumeration of walks. For k — 2, a closed walk of 
length 2 is just a "round trip" on an edge: as a tree 
has n—1 edges, Tr T 2 = 2(n — 1) for each tree, then 
P2, n = 2 — 2/n. The edges visited by a closed walk of 
length 2k on a tree form a subtree with I < k edges. By 
erasing the I visited edges, the rest (i.e the spectator edges) 
splits up into a "rooted forest" with n nodes and m = I + 1 
rooted trees. The number of such rooted forests is 

(n - IV 

TP — v n ~ m — — (7) 

(n — m)!(m — 1)! 

The number of closed walks of length 2k visiting all nodes 
of trees with / edges is (1 + 1)1 Jk : i where the integer num- 
bers Jk,i can be computed with a recursion relation: see 
Eq. 8 and Table 1 of [Bauer and Golinelli, 2001a, p 307]. 
Consequently, [Bauer, 2000] 
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valid for any n > 1 and the first ones are 







z 




= 2- 






n 
20 




= 8- 


h 




n 




= 40- 


176 


M6,n 


n 



IV' 



(9) 
(10) 
(11) 



The lar ge-77 limits of the pk.n are 



^(? + l)Jfc,i and M2fc+i-0. (12) 



z=i 



The /ifc are the moments of the asymptotic distribution 
p(x) of the spectral density for large random trees. Note 
that p(x) is defined without the need to scale it with a 
power of n. It is an "universal" distribution in the sense 
that it depends of no parameter, but only on the definition 
of the labeled trees. 

The sequence (p>2k)k>i starts [Bauer, 2000] with 2, 8, 40, 
226, 1384, 8992, 61212, 433136, etc. and we are not aware 
of any other problem involving the same sequence. It can 
be proved [Bauer and Golinelli, 2001a] that the growth 
of the coefficients Jk,i, and then of the p,2k, is sufficiently 
slow to characterize p(x), and sufficiently fast to assure 
that the support of p(x) is unbounded. 

4.2 Delta peaks and continuous distribu- 
tion 

It is not easy to extract accurate local information on the 
asymptotic spectral density p(x) from the knowledge of a 
finite number of moments. So we have recourse to numer- 
ical computations as described in Sect. 3. A histogram of 
Monte-Carlo eigenvalues for n = 10000 is shown on Fig. 1. 
As the spectra of trees are always symmetric with respect 
to zero, the histogram is folded up around zero for conve- 
nience sake. 

Only one choice of size n is represented on Fig. 1 be- 
cause the dependence on n is small. In order to give 
an idea of these finite size effects, we show on Fig. 2 
the (folded) cumulative density R n (x) = f* p n (y) dy for 
several choices on n. We have observed that R n (x) con- 
verges as 1/n; for example, the values of the maximal dis- 
tance D n — max x \R n {x) — i?ioooo(a ; )| are: Z? 2 3 ~ 0.021, 
L»50 ~ 0.0077, L>ioo « 0.0037 and L>i oo « 0.00042. So 
we can consider that n — 10000 is sufficiently large to ob- 
serve on Fig. 1 a distribution very close to the asymptotic 
distribution p(x). 

The main observation is that p{x) has several peaks with 
holes around them. These peaks correspond with exact 
degeneracies of eigenvalues: they are delta peak in the 
sense of distribution theory. Unfortunately, their heights 
on Fig. 1 arc not representative because they depend on 
the width of the histogram bin, chosen arbitrarily. On the 
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Figure 1: Average spectral density p n {x) of labeled trees 
on n = 10000 nodes as function of x, represented by a 
histogram of Monte-Carlo eigenvalues. The bin width is 
0.001. Because of its symmetry, the histogram is folded 
up around zero. 
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Figure 2: Cumulative spectral density of labeled trees on 
n = 23, 50 and 10000. The convergence is as 1/n. The 
vertical steps correspond with delta peaks. Because of its 
symmetry, the density is folded up around zero. 



other hand, they are regularly represented by the vertical 
steps on Fig. 2 but now only the main ones are visible. Nu- 
merical estimations of their heights are given in Sect. 5.2. 

The largest delta peaks are, in order of importance, at 
x = 0,±l,±\/2, etc. which are precisely the eigenvalues 
of the small trees listed in Table 1: T u T 2 , T 3 , etc. This 
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has been previously observed in several models of random 
graphs [Kirkpatrick and Eggarter, 1972; Evangclou, 1983; 
Farkas and al., 2001]. In Sect. 5.1 we show that the spec- 
tral density, in the large n limit, has an infinity of delta 
peaks at all eigenvalues of finite trees and that the corre- 
sponding eigenvectors are strictly localized, in the sense 
that their number of non-vanishing coordinates (i.e. the 
number of nodes on which the vector docs not vanish) is 
finite. 

Moreover we have noticed numerically that the recipro- 
cal is true: all the degeneracies (i.e. two or more equal 
values) among the complete set of Monte-Carlo eigenval- 
ues appear only at eigenvalues of finite trees and that the 
corresponding eigenvectors are strictly localized on a small 
number of nodes. In other words, the delta peaks appear 
at these special values and nowhere else. However the 
heights of peaks are exponentially small with the size of 
the corresponding finite subtree. Thus on Fig. 1 and 2, 
only the largest ones arc visible because the smallest ones 
are drowning into Monte-Carlo fluctuations. 

To summarize, our simulations indicate that the asymp- 
totic distribution p{x) has two components in the large n 
limit: a discrete component Pd{x) made of an infinity (but 
countable) of delta peaks at all eigenvalues of finite trees 
associated with strictly localized eigenvectors, and a con- 
tinuous component p c (x) built with all eigenvalues which 
are not eigenvalues of finite trees which form a continu- 
ous support. As explained in Sect. 5.2, the total weight of 
p c (x) is 0.80827± 0.00005. Consequently, the total weight 
of p d (x) is 0.19173 ±0.00005. 

Note that the spectral density of the trees on n nodes, 
for any n, is purely discrete because it is defined by a 
finite list of eigenvalues. So the continuous compo- 
nent p c {x) appears only in the large n limit. However it is 
possible to study p c {x) with Monte-Carlo finite trees with 
the following procedure. Firstly we enumerate all "small" 
trees on n < 23 nodes and their spectra are listed. Next, 
during Monte-Carlo simulations of much larger trees, we 
research if the generated eigenvalues are among the spec- 
tra of small trees. If they are, we regard them as part of 
delta peaks; otherwise, we regard them as part of p c {x). 

Of course, the threshold n = 23 is arbitrary and dic- 
tated by the power of our computer. But this threshold 
can be reduced up to n w 15 without significant changes 
because the delta peaks corresponding to trees on n > 15 
nodes are so small that they are not visible in our sim- 
ulations. As the heights of peaks decrease exponentially 
with n, the effective threshold increases as the logarithm of 
the computational time: then the value n = 23 is greatly 
sufficient. 

With the previous criterion, we show on Fig. 3 the dis- 
crete spectral density Pd{x) represented by the histogram 
of eigenvalues of Monte-Carlo trees on n = 10000 nodes 
which are also eigenvalues of small trees, drawn as delta 
peaks with now a correct height. As 30 x 10 6 eigenvalues 
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Figure 3: Delta peaks of the spectral density: eigenvalues 
of Monte-Carlo trees on n = 10000 nodes which are also 
eigenvalues of a small trees. Because of its symmetry, the 
histogram is folded up around zero. Note that the y-axis 
is logarithmic. 
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Figure 4: Continuous component p c (x) of the spectral den- 
sity: eigenvalues of Monte- Carlo trees onn — 10000 nodes 
which are not eigenvalue of a small tree. Because of its 
symmetry, the histogram is folded up around zero. 



are generated, only peaks with height bigger than 10~ 7 
are observed with sufficient statistics. Because of this nu- 
merical cutoff, the reader could believe that the support 
of Pd(x) has some gaps, but it is wrong because the eigen- 
values of finite trees are a dense (but countable) subset of 
real numbers. A proof is given in Appendix B. 
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The histogram of the other eigenvalues (i.e. which are 
not eigenvalues of small trees) is drawn on Fig. 4: therefore 
it represents the continuous component p c (x) of the spec- 
tral density. As expected, it looks like Fig. 1 but without 
peaks. In particular, p c (x) has holes at x — 0, ±l,±y/2, 
etc. which are positions of the delta peaks. By varying 
the width of the histogram bins, p c (x) appears to be zero 
and continuous at these special values, even if it is not 
really visible on Fig. 4. Around each special value, the 
spectral density concentrates to built a delta peak with a 
hnitc height and this concentration leaves a hole. This ef- 
fect is proportional to the height of the peak. Those holes 
remember that in quantum mechanics, an eigenvalue with 
a non localized eigenvector is repelled by the others. 

By generalization, we conjecture that any delta peak 
makes a hole in p c (x) proportional to its height. As delta 
peak positions are a dense (but countable) subset of the 
real numbers, p c (x) is a density function, non-continuous 
and non-zero almost everywhere except at these special 
values where it is continuous and zero. From a mathemat- 
ical point of view, note that such "pathological" functions 
exist, for example f(x) = cxp(— x 2 ) minj{2 4 (x — \i) 2 } 
where the (Aj)j>i are the special points. Remark that 
there is no contradiction with the word continuous: in the 
sense of the measure theory, p c (x) is a continuous distri- 
bution because its support is continuous or equivalently 
its integral p c (y) dy is a continuous function. On the 
contrary, the discrete distribution pd has a discrete sup- 
port and its integral has discontinuities (or jumps) for all 
delta peaks. 

A sceptical reader could object that p c (x) could have a 
singular continuous component, i.e. a measure everywhere 
zero excepts at a non-countable set with Lebesgue measure 
zero. A famous example on [0, 1] is the Cantor measure 
on the Cantor set. In this measure, the sum of the width 
of the gaps is 1 and all the measure is concentrated on the 
Cantor set. As this set has Lebesgue measure zero, the 
density is infinite on its points. But it is not a discrete 
distribution (with delta peaks) because this set is non- 
countable then the measure of any point remains always 
strictly zero. Equivalently, the integral is a continuous 
function. 

Our numerical analysis can not determine if p c (x) con- 
tains or not a singular part because Monte-Carlo fluctua- 
tions prevent from distinguishing between a singular mea- 
sure and a measure with a pathological density function. 
But we think that p c (x) vanishes only on the delta peaks 
positions, which are a set with Lebesgue measure zero, un- 
like the Cantor measure. Then the density remains finite 
everywhere, even if it is a pathological function. As we do 
not succeed to imagine a mechanism which concentrates 
the spectral density on a non-countable set with Lebesgue 
measure zero, we believe that p c (x) is not singular. 

To summarize this section, the spectral density of large 
random trees have an infinity of delta peaks at all eigen- 



values of finite trees, and a pathological density function 
vanishing at all peak positions. 

5 Delta peaks in the spectral den- 
sity 

5.1 Existence of delta peaks 

To show that a delta peak appears in the spectral density 
of large random trees at any eigenvalue of finite tree, we 
will adapt arguments of [Bauer and Golinelli, 2001a, p 
322] used for the same property in random graphs. First, 
we show that in a large tree on n nodes, the number of 
branches isomorphic to any finite tree T is proportional 
to n. Consequently delta peaks appear in the spectral 
density because the multiplicity of eigenvalues of T with 
eigenvectors strictly localized on branches is also 0(n) in 
large trees. 

We now introduce two definitions. A rooted tree (T, r) is 
a tree T where the node r is marked (the root) . A branch 
(B, r) at a node r in a tree T is a rooted subtree of T with 
no edges between nodes of B \ {r} and nodes of T \ B. 
Remark that it is not the usual definition. 

Let us consider any rooted tree (T, r) on m nodes. The 
number of branches isomorphic to (T, r) among all labeled 
trees on n nodes is 

B{n ^ = c I™' m {n m + l) ( "- m - 1) , (13) 
S [n — my. 

because the first factor counts the number of ways of la- 
beling the branch (5 is the symmetry factor of (T, r)), and 
the second factor counts the choices of the rest of the tree, 
i.e. (n — m) nodes, plus r. Consequently, a tree on n nodes 
has in average b(n) = B(n)/n n ~ 2 branches isomorphic to 
(T, r); in the large n limit, 

g-(m-l) 

b(n) — n . (14) 

For example, m — 2 gives n/e as the number of leaves 
in average in a large tree. Moreover, by considering more 
complex patterns with two branches, it can be proved that 
the variance of this number of branches per tree is 0(n) 
because contributions in 0{n 2 ) are canceled. Then this 
number is "self- averaging" : in the large n limit, for almost 
all trees, the number of branches isomorphic to (T, r) is 
ne -(m-i) j g p mg fluctuations. The amplitude of 

theses fluctuations has no simple formula because it de- 
pends not only on the size m but also on the shape of 
T. 

Let U be a tree with a branch (T, r) at a node r; let 
(A, V) be an eigenmode of T: if V r = 0, then A is eigenvalue 
of U. Indeed the vector W on U which extends V with 
coordinates zero outside T is also eigenvector with same 
eigenvalue A, independently of the size and the shape of 
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Figure 5: Construction of the tree D by duplication of the 
initial tree T (see text). As the eigenvector vanishes on 
r, D can be "grafted" on a larger tree (shaded area) for 
which the eigenvector is extended with coordinates zero. 

the rest of U. This shows that A can be found among 
spectra of large trees, with an eigenvector strictly localized 
on any branch (i.e. strictly zero outside) isomorphic to 
(T, r) . As the number of such branches in a large tree is 
0{n), the multiplicity of A is also 0(n): this gives a delta 
peak at A in the spectral density with strictly localized 
eigenvectors. 

Of course, this is true for any eigenvalue of T if the cor- 
responding eigenvector V vanishes on (at least) one node, 
by choosing this node as branching node r. In case all the 
coordinates Vi ^ 0, we transform T (with m nodes) into a 
larger tree D on 2m + 1 nodes, with the same eigenvalue 
A and now an eigenvector W with a coordinate W r = 0. 
As shown on Fig. 5, D is made of two copies T' and T" 
of T, plus one node r, plus two edges (r, r') and (r, r") 
where r' and r" are chosen among the nodes of X" and 
T" respectively (and not necessarily symmetric) . We note 
a' = V r > and a" = V r » (following the hypothesis, a' and 
a" ^ 0). Let W be the vector on the nodes of D where 
the restriction on T" is a"V, the restriction on T" is —a'V 
and W r — 0. As W r > + W r » = 0, the eigenvalue Eq. (2) is 
satisfied for r, then (A, W) is effectively an eigenmode of 
D, but with W r = 0. Now, we use the previous arguments 
with D in place of T to show that a delta peak appears at 
A in spectra of large trees. Of course this construction is 
possible but useless if the initial vector has a coordinate 
zero. 

Remark that this construction can be generalized by 
many ways. The trees T" and T" are not necessarily iso- 
morphic: the only constraint is that they must have A in 
common in their spectra. In particular, this construction 
can be iterated, by using D or U in place of T' or T". 
Moreover, the transformation can involve k > 2 trees and 
an additional node r connected to k nodes (rj)j=i ; fc (one 
per tree) with coordinates non-zero. The multiplicity of A 
is — 1 because the coefficients a, must satisfy the eigen- 
value condition on r: J2i a iVri = 0. 

The previous argument shows that the delta peaks do 
not vanish in the large n-limit but Eq. (14) gives only 
a coarse approximation of their height for many reasons. 
First, several finite trees with the same A contribute to the 
same peak (see Appendix C) and Eq. (14) shows that the 



main contributions come from trees with shape like D (see 
Fig. 5) and two components T" and T" with minimal size 
m. This contributions is s e~ 2m where s counts choices of 
T" and T" and choices of roots, without forgetting symme- 
try factors. Secondly, in the case where the construction 
of the above paragraphs is done with k > 3 trees, the mul- 
tiplicity of A is k — 1 but the tree D is counted k(k — l)/2 
times in Eq. (13). So we must use inclusion-exclusion for- 
mula: this gives a 0(e~ 3m ) correction. 

Because of these remarks, we do not know analytical 
formula for the heights of the delta peaks except at A = 
[Bauer and Golinelli, 2000] which corresponds to the tree 
on m = 1 node. However we keep the main result: in the 
large random tree limit, the multiplicity of any eigenvalue 
of finite tree is 0(n) and a delta peaks appears with a 
self-averaging height. 

5.2 Statistics of delta peaks 

In this section, we give numerical estimations for the 
heights of delta peak in the spectral density of large ran- 
dom trees. We have explained in previous sections that 
these peaks correspond to eigenvalues of finite trees. As 
their number increases exponentially with their size, it 
would be cumbersome to give a list of heights for each 
eigenvalue. So we prefer to group eigenvalues by order. 

Let A be an eigenvalue of a tree on n nodes. We define 
the order of A as the size d of the smallest tree(s) with 
eigenvalue A. Of course, d < n. We define Cd as the set 
of eigenvalues of order d: it is the finite set of eigenvalues 
of trees on d nodes but not in Cd' with d' < d. 

In principle, Cd can be explicitly written by enumerating 
all spectra of trees on d' < d nodes, as done on Table 1. 
As the spectrum of the trivial tree T\ on one node is {0}, 
then d = {0}. For d = 2, the tree T 2 gives C 2 = {-1, 1}. 
For d = 3, the spectrum of T 3 is {-■ v/2,0, v%- As is 
already of order 1, then £3 = {— \/2, \/2}. The following 
sets are £4 = ±{(V5± l)/2,\/3}, C 5 = ±{ V2~±\72, 2}, 
etc. Generally, as all spectra are symmetric with respect 
to zero, the Cd are also symmetric. 

An eigenvalue of order d is by definition a root of a 
characteristic polynomial, which is an integer polynomial 
of degree d. But this polynomial can often be factorized, 
for example if d is odd (because is eigenvalue by sym- 
metry) or if the tree has some symmetries [Cvetkovic et 
al., 1995, ch 4]. So an eigenvalue of order d is an algebraic 
number of degree d! with d! < d. It would be interesting 
to be able to determine the order of a given eigenvalue of a 
tree T without have to enumerate all the trees smaller than 
T. However, we have only a partial answer expounded in 
Appendix C. 

We define Td as the sum of the heights of delta peaks 
corresponding to eigenvalues of order d, in the large ran- 
dom tree limit. In other words, Td is the fraction of the 
spectrum of a large random tree occupied by eigenvalues 
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Table 2: The fraction Td of the spectrum of a random tree on n nodes occupied by eigenvalues of order d. The column 
n = 23 gives exact results. The columns n = 100, 1000 and 10000 give Monte-Carlo results; the one-sigma standard 
error (number inside parentheses) is given only for n = 10000 because it is on the same amplitude for the other n. 
The column n = oo gives extrapolations (see text). The row labeled "cc" is the complement, i.e. the weight of the 
continuous component in the spectral density. 



of order d. 

For d = 1 (i.e. A = 0) we know [Bauer and Go- 
linelli, 2000] that T\ — 2W - 1 = 0.134286..., where 
W = 0.567143 ... is the root of e~ x = x. However for 
d > 2, we have no analytic formula and numerical results 
are given in Table 2. Results for n = 23 arc obtained by 
exact enumeration, described in Sect. 3.1. For larger n we 
use the Priifer Monte-Carlo method described in Sect. 3.2 
by averaging over m = 30 x 10 6 eigenvalues for each size 
n. 

Moreover we know [Bauer and Golinelli, 2000] that 
T\ for a finite size n has an asymptotic expansion in 
powers of l/n. We observe numerically the same be- 
havior for d > 2. So we extrapolate for n = oo 
with least-square fits with linear or quadratic function of 
l/n. For these fits, we include Monte-Carlo results for 
n = 30, 50, 100, 200, 500, 1000, 2000, 5000 and 10000 and 
the standard errors for the extrapolations are smaller than 
the ones of the measures, as shown on Table 2. By simu- 
lating trees on n = 10001 nodes, we have also verified that 
the results do not show perceptible parity effects. 

We also observe numerically that, for a given d, the 
variance of Td for random trees on n nodes is propor- 
tional to l/n: in other words, the Td are self-averaging 
for large trees. The opposite would be very surprising be- 
cause the eigenvectors corresponding to eigenvalue of finite 
order are strictly localized on finite patterns: the appear- 
ances of these patterns look like independent events on a 
large random tree. A consequence is that the standard 
error of Td is 0(1/ y/m) (for a given d) where m is the 
total number of generated eigenvalues, independently of 
the size n. 

We are also interested by the localization of eigenvec- 
tors. For an eigenvector V corresponding to an eigenvalue 



of order d, let I be the number of coordinates (or nodes) 
where Vi ^ 0. We observed that most of them are local- 
ized on I — 2d nodes: they correspond to the construction 
shown on Fig. 5 with T" and T" as smallest as possible. 

As explained in Sect. 5.1 and Appendix C, eigenvectors 
with arbitrary large I can be built. But the probability of 
appearance of large patterns decreases exponentially with 
their size. So in our Monte-Carlo simulations, a few of 
eigenvectors have an extension I > 2d without ever exceed- 
ing Ad. In principle, with more important simulations, it 
would be possible to observe larger (and rarer) localized 
eigenvectors. 

In particular we do not remark a derealization phe- 
nomenon, as observed [Bauer and Golinelli 2001] in the 
Erdos-Rcnyi random graph model for eigenvectors with 
A = 0, when the average effective connectivity is between 
2.093 . . . and 3.312. . . Note that in the random tree model, 
the average connectivity is by definition fixed to 2. 

6 Conclusion 

In this work we have studied the spectral density of the 
adjacency matrix of random labeled trees, as a model of a 
hopping particle on a connected graph. In the limit where 
the number n of nodes is large, our main results are: 

• The averaged spectral density converges to an asymp- 
totic distribution p(x) with a 0(1 /n) behavior. More- 
over the spectral density is self-averaging, i.e. the 
spectral density of a given tree is p(x) almost surely. 

• At all eigenvalues of finite trees, p(x) has a delta peak. 
Its height decreases exponentially with the size of the 
corresponding finite tree. Except for the A = peak, 
we do not know how to calculate analytically these 
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heights, but numerical estimations are given. The 
total weight of these peaks is 0.19173 ± 0.00005. 

• The rest of p(x) is a distribution given by a density 
function which vanishes at each position of delta peak. 
As these positions form a dense set among real num- 
bers, this function is almost everywhere discontinu- 
ous. 

• The eigenvectors corresponding to eigenvalues of 
delta peaks are strictly localized, i.e. they vanish ev- 
erywhere except on a finite number of nodes. On the 
other hand, the other eigenvectors are not strictly lo- 
calized. 

It seems difficult to extend the analytical calculation 
[Bauer and Golinelli, 2000] of the height of the A = 
peak to all peaks because it is specific to this particular 
eigenvalue: there is only one "pure" tree for A = and 
moreover this tree is the trivial tree on one node. Then 
the enumeration of composite trees with A = is possible. 
In contrast, each other peak corresponds to an infinite 
family of pure trees, which are not trivial. 

We think that these results are shared by many models 
of graphs. For any eigenvalue of finite graph g with eigen- 
vector V , a delta peak appears in the spectral density of 
a large graph G on n nodes if the number of induced sub- 
graphs of G isomorphic to g (and connected to the rest 
of G only by nodes on which V vanishes) is 0(n). Of 
course the height of the delta peak depends on the details 
of the model. Generally, the highest ones correspond to 
the smallest g and their contribution gives an important 
fraction of the total weight of delta peaks. 

Furthermore the existence of a pathological density 
function for the continuous part of the spectral density 
is not specific to the random trees. As it corresponds to 
eigenvectors which are not strictly localized and orthogo- 
nal to eigenvectors of the delta peaks, it is expected that 
the density function has a depression around each delta 
peak. Moreover we think that these conclusions are also 
valid for other kind of Hamiltonian, for example the Lapla- 
cian, or graphs with weighted links. 

A Symmetry of the spectrum 

In this appendix, we show that the bipartition of any tree 
induces that its spectrum is symmetric with respect to 
zero. Moreover it allows to reduce the computation time 
of the spectrum by a factor 8. 

Let T be a tree on n nodes. The set of nodes can be 
partitioned into two subsets, P and Q, of sizes p and q 
(with p + q = n), so that all edges link a P node with a Q 
node. We consider the spectrum {\i)i = i, n of the symmet- 
ric adjacency matrix A of T . In this notation, degenerate 
eigenvalues correspond to several indices i. Note that all 
eigenvalues are real. First, without changing the spec- 
trum, wc permute the labels of the nodes of T so that P 



nodes are labeled by (1,2, ... ,p) and Q nodes are labeled 
by (p+l,p+2, . . . , n). In this basis, the rows and columns 
of A are permuted and A has now a (p + q) x (p + q) block 
shape 

R ) (15) 

where R is a rectangular q x p block and R T is the trans- 
pose of R. Then, 



R T R 
RR T 



(16) 



is block diagonal: its spectrum (A 2 )i = i jn is the union of 
the spectra of the square p x p block R T R and of the 
square q x q block RR T . Wc will show that it is sufficient 
to diagonalize only one block, R T R for example. 

Let V be an eigenvector of R T R (V is p-dimensional) 
with eigenvalue A 2 , then R T RV = X 2 V. We assume that 
A ^ 0. Then RV is a (g-dimcnsional) eigenvector of the 
other block RR T with the same eigenvalue A 2 , because 
RR T RV = R(X 2 V) = X 2 RV. The two n-dimensional 
vectors 



U± 



±xv 

RV 



(17) 



are eigenvectors of A with eigenvalues ±A because 



AU± 



R T RV 
±XRV 



X 2 V 
±XRV 



±XU± (18) 



If A 2 is a degenerate eigenvalue of R T R, this work can be 
done with any corresponding eigenvector. 

To resume, any eigenvalue A 2 > of one block of A 2 
is associated to an eigenvalue A 2 of the other block and a 
couple ±A of A. The rest is made of eigenvalues 0, with 
m(A) = m(R T R)+m(RR T ) = 2m(R T R) + q-p by noting 
m(-) its multiplicity. Then the spectrum of A is symmetric 
with respect to zero. Moreover eigenvalues (and eigenvec- 
tors) of A can be deduced from the diagonalization of only 
one block. 

From a numerical point of view, the complete diagonal- 
ization of a n x n full matrix needs a time C(n 3 ). As 
min(p, q) < n/2, it is at least 8 times faster to diagonalize 
the smallest block, RR T or R T R, instead of A. In the 
large n limit, p <~ q <~ n/2 for almost all trees, so the 
asymptotic factor is 8. 

Remark that the characteristic polynomial of A can by 
computed in 0(n 2 ) by using a recursion procedure with 
the characteristic polynomials of the subtrees obtained by 
deleting one node. However the computation of eigenval- 
ues from the knowledge of the characteristic polynomial is 
numerically very instable, so we prefer diagonalize A with 
Lapack routines for symmetric full matrix. 
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Figure 6: Top: a linear tree of length n. Bottom: a linear 
tree of length n decorated with bunches of k leaves. 

B Density of the eigenvalues of fi- 
nite trees 

In this appendix, we show that the set L of eigenvalues 
of finite trees is a dense subset of real numbers. Remark 
that by definition C is countable because finite trees are 
countable. We proceed in two steps: firstly the eigenvalues 
of linear trees are a dense subset of [—2, 2]. Secondly the 
eigenvalues of linear trees decorated with bunches of k 
leaves are a dense subset of ±[\/fc + 1 — 1, \Jk + 1 + 1]. As 
the union of all these intervals covers the real numbers, C 
is everywhere dense. 

For a linear tree of length n, pictured on Fig. 6, A is 
eigenvalue with eigenvector V — {vi}i = i. n if 

A Vi = Vi-i + v i+1 (i = 1,2, . . . ,n), (19) 

with the conventions vq = v n +i = 0. The n solutions are 



A„ = 2 cos I _EZ!_ 
v n + 1 



ipir 

v x . p = sin I — — 
\n + 1 



(20) 



for p — 1,2, ... ,n. Then the union of spectra of finite 
linear trees is dense in [-2,2]. 

Now we consider a linear tree of length n decorated with 
bunches of ft > 1 leaves: it is made of a linear backbone 
{ai}i=i, n , and for each i, k nodes {bij}j=i,k are connected 
to ai. See Fig. 6. Its size is (fc + l)n. 

For an eigenvalue A, we note ai and bi_j the coordinates 
of the eigenvector on the respective nodes. The eigenvalue 
equations are 

Xb itj = a, (i = l,2,...,n; j = l,2,...,fc) (21) 

k 

X ai = ai-! + a i+1 +^bij (i = 1, 2, . . . , n) (22) 
i=i 

with the conventions ao = a n +i =0. If A = 0, then 

fe 



2 b * * 

3 = 1 



a i 



(i = l,2,...,n), 



(23) 



Figure 7: Top: a linear tree of length n decorated with 
isomorphic trees U. Bottom: decomposition of U . 



and the multiplicity of the eigenvalue is {k — l)n. For 
A^O, 

h,j=ai/X, (X- k/X) ai = at-i + a i+ i. (24) 

Then the equation for [i = A — k/X is similar to Eq. (19). 
There are 2n non-zero eigenvalues given by 

= \{n P ±^jnl + Ak), (25) 

(p=l,2,...,n). (26) 



2 cos 



pit 
n + 1 



As the set of \i v for n > 1 is dense in [—2, 2], then the set 
of Ap + ^ is dense in [\/fc + 1 — 1, \/k + 1 + 1] and the set of 
A^r' is dense in [~Vk+ 1 - 1, -\/k + l + 1]. 

The union of these intervals for k > 1, plus the interval 
[—2, 2] given by linear trees, covers all the real numbers. 
So the trees described above are sufficient to prove that 
eigenvalues of finite trees are a dense subset of real num- 
bers. 

We remark that the same calculation can be done with 
other decoration patterns. Let us consider now a tree with 
a linear backbone {ai}i = i t „, and for each i, a rooted tree 
(U, r) is grafted by merging r and pictured on Fig 7. 

Then, Eq. (24) is generalized by 



/(A) aj = ai-i + a i+ i, 
where the function f(x) is defined by 



fix) 



Xu(x) 



(27) 



(28) 



by noting xt{x) the characteristic polynomial of the ad- 
jacency matrix of a tree T, and Ui, U2, ■ ■ ■ U s the subtrees 
of U obtained by removing its root r (see Fig 7). The 
eigenvalues are the set of / _1 (^ p ), solutions of f(x) = [i v 
and they are dense in 2, 2]). 

That remembers the band theory in solid state physics. 
We look at U as an atom: an eigenvalue / _1 (0), solution 
of xu{ x ) — 0, is an energy level. The linear tree decorated 
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with U is like a crystal, and the terms a,i±\ in Eq. (27) 
mean that electrons can hope from an atom to its neigh- 
bors. Then each atomic level / _1 (0) is stretched and forms 
a band / _1 (2 cos(fc)) where k is the wave vector. 

C Large pure trees 

For any finite tree T with eigenvalue A, we have seen in 
Sect. 5.1 that it is possible to build arbitrarily large trees 
with same eigenvalue A by connecting many copies of T 
with additional nodes with coordinates zero. In this Ap- 
pendix, we will show that arbitrarily large trees with eigen- 
vectors without coordinates zero can be obtained as well. 

Let T be any tree with eigenvalue A and eigenvector V. 
If all coordinates Vi + 1 , we say that T is X-pure. If T 
is A-pure, it can be proved that A is not degenerate in the 
spectrum of T. Then the A-purity is really a property of 
the tree, and it does not depend on a particular choice 
of eigenvector. By noting X m the largest eigenvalue of T, 
remark that T is A m -pure because the Perron-Frobenius 
theorem assures that all coordinates of the corresponding 
eigenvector are all positive. 

If T is not A-pure, it is composite. By deleting any 
node i with Vj — 0, T is split into a forest of one or more 
subtree(s); on each subtree, the restriction of V is again an 
eigenvector with eigenvalue A. Then by deleting all nodes 
with coordinates 0, the rest is a forest of A-pure subtrees. 
A consequence is that the order of A is less or equal to the 
size of the smallest pure subtree. 

Naturally we wish enumerate A-pure trees. The case 
A = can be easily solve: for any leaf i, its neighbor j has 
Vj = XVi = 0. So there is only one 0-pure tree: the trivial 
tree on one node. On the contrary, if A + 1 0, we will show 
that there is an infinity of A-pure trees. 

Many processes can built an arbitrary large A-pure tree 
U by using several A-pure trees Ti (not necessarily isomor- 
phic) and additional nodes. Some processes are general: 
they apply to all trees Ti. But other processes are special- 
ized to trees with special patterns or special A (for example 
integer A). There is an infinity of kind of processes and 
we have not found a unified way to present them. So we 
content ourself to give some examples. 

As it is cumbersome to explain the details with words, 
we show graphical representations of the resulting tree U 
with its eigenvector W by using the following rules. A 
triangle represents a A-pure tree Ti and the restriction of 
W on Ti is the eigenvector of Tj with eigenvalue A. When 
a node r is labeled by x, it means that W r = x. If r is 
a vertex of a triangle Tj, the eigenvector of Tj is scaled 
in order that W r = x. Remark that it is always possible 
because Tj is pure, i.e. without coordinate zero. When a 
node r is a common vertex of triangles Tj and Tj , it means 
that a node ?-j of Ti is merged with a node rj of Tj . The 
neighbors of r are the ones of , plus the ones of rj . Of 
course, all these processes are done in order that Eq. (2) 




b patterns 

Figure 8: Two processes to built a large "pure" tree with 
any eigenvalue of finite tree and an eigenvector without 
coordinate zero. Left: the "2+1" process. Right: the 
"b+l,b" process. 

Figure 9: A process to built a larger "pure" tree with two 
pure trees by merging two couples of leaves. 

is satisfied for W on every nodes. 

The simplest general process, called "2+1", is shown 
on Fig. 8. It uses three A-pure trees Tn 2,3} with one 
merge and one additional node. So Size([/) = J2 i Size(Tj). 
Remark that U is pure because A ^ 0. The reader can 
verify that Eq. (2) is satisfied on every nodes of U. For 
example, let us consider the common node r between T\ 
and T 2 with W r = 1: the neighbors j\ of r inside the 
tree T\ give Wj 1 = XW r , because the restriction of W 
on Ti satisfies Eq. (2). And the same for the neighbors 
J2 inside the tree T 2 . As r is also linked with a node of 
T3 with coordinate —A, then all neighbors j of r gives 
J2j Wj — XW r ■ This process can be iterated by using U 
in place of Tj to obtain arbitrary large A-pure trees. 

A generalization is the "b+l,b" process described on 
Fig. 8, with 6+1 pure trees with a common node linked 
to b patterns made of one pure tree and an additional 
node. Here again Sizc([7) = ^\ Size(Tj). 

It is easy to imagine other processes of this kind with 
more complex "bridges" between pure trees. But we pre- 
fer to describe on Fig. 9 a process which exploits a specific 
pattern. It uses two pure trees with a couple of leaves 
linked to the same node: U is built by merging these four 
leaves into a single node with a doubled coordinate. Then 
Size(£7) = Size(Ti) + Size(T 2 ) — 3. This process can be gen- 
eralized with k trees with bunches of k leaves, by merging 
the k 2 leaves into a single node with a coordinate multi- 
plied by k. Once again, more complex processes can be 
imagined. 

To calculate analytically the height of delta peaks in 
the spectral density, a method would be to enumerate the 
trees with eigenvalue A by first enumerating pure trees, 
and then by counting all combinations of pure trees into 
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composite trees. Unfortunately, these few examples above 
show that there is a great variety of pure trees, and we 
have no method to do a full listing. The only case [Bauer 
and Golinelli, 2000] for which analytical results are know 
is for A = thanks to two facts: this is only one 0-pure tree 
which has one node, and then the count of combinations 
is done by a method specific to this trivial tree. 
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